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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CTCGGGTAAC TTCTTGAGCG CGGCCACAGC AGCCTTGATC ATGAAGGCGA GCATGGTGAC 6 0 

CTTGACGCCG CTCTTTTCGT TCTCTTTGTT GAACTGCACG CGAAAGGCTT CCAGGTCGGT 12 0 

GATGTCCGCG TCGTCGTGGT TGGTGACGTG CGGGATGACC ACCCAGTTGC GGTGCAGGTT 18 0 

TTTCGATGGC ATAATATCTG CGTTGCGACG TGTAACACAC TATTGGAGAC ATATCATGCA 24 0 

AACGCTCAGC ATCCAGCACG GTACCCTCGT CACGATGGAT CAGTACCGCA GAGTCCTTGG 3 00 

GGATAGCTGG GTTCACGTGC AGGATGGACG GATCGTCGCG CTCGGAGTGC ACGCCGAGTC 360 

GGTGCCTCCG CCAGCGGATC GGGTGATCGA TGCACGCGGC AAGGTCGTGT TACCCGGTTT 420 

CAT C AATG C C CACACCCATG TGAACCAGAT CCTCCTGCGC GGAGGGCCCT CGCACGGACG 480 

TCAATTC TAT GACTGGCTGT TCAACGTTGT GTATCCGGGA CAAAAGGCGA TGAGACCGGA 54 0 

GGACGTAGCG GTGGCGGTGA GGTTGTATTG TGCGGAAGCT GTGCGCAGCG GGATTACGAC 60 0 

GATCAACGAA AACGCCGATT CGGCCATCTA CCCAGGCAAC ATCGAGGCCG CGATGGCGGT 660 

CTATGGTGAG GTGGGTGTGA GGGTCGTCTA CGCCCGCATG TTCTTTGATC GGATGGACGG 72 0 

GCGCATTCAA GGGTATGTGG ACGCCTTGAA GGCTCGCTCT CCCCAAGTCG AACTGTGCTC 78 0 

GATCATGGAG GAAACGGCTG TGGCCAAAGA TCGGATCACA GCCCTGTCAG ATCAGTATCA 84 0 

TGGCACGGCA GGAGGTCGTA TATCAGTTTG GCCCGCTCCT GCCACTACCA CGGCGGTGAC 900 

AGTTGAAGGA ATGCGATGGG CACAAGCCTT CGCCCGTGAT CGGGCGGTAA TGTGGACGCT 960 

TCACATGGCG GAGAG CGATC ATGATGAGCG GATTCATGGG ATGAGTCCCG CCGAGTACAT 1020 

GGAGTGTTAC GGACTCTTGG ATGAGCGTCT GCAGGTCGCG CATTGCGTGT ACTTTGACCG 1080 

GAAGGATGTT CGGCTGCTGC ACCGCCACAA TGTGAAGGTC GCGTCGCAGG TTGTGAGCAA 114 0 

TGCCTACCTC GGCTCAGGGG TGGCCCCCGT GCCAGAGATG GTGGAGCGCG GCATGGCCGT 12 00 

GGGCATTGGA ACAGATAACG GGAATAGTAA TGACTCCGCA AACATGATCG GAGACATGAA 1260 

GTTTATGGCC CATATTCACC GCGCGGTGCA TCGGGATGCG GACGTGCTGA CCCCAGAGAA 1320 
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GATTCTTGAA ATGGCGACGA TCGATGGGGC GCGTTCGTTG GGAATGGACC ACGAGATTGG . 138 0 

TTCCATCGAA ACCGGCAAGC GCGCGGACCT TATCCTGCTT GACCTGCGTC ACCTCAGACG 144 0 

ACTCTCACAT CATTTGGCGG CCACGATCGT GTTTCAGGCT TACGGCAATG AGGTGGACAC 1500 

TGTCCTGATT GACGGAAACG TTGTGATGGA GAACCGCCGC TTGAGCTTTC TTCCCCCTGA 156 0 

ACGTGAGTTG GCGTTCCTTG AGGAAGCGCA GAGCCGCGCC ACAGCTATTT TGCAGCGGGC 162 0 

GAACATGGTG GCTAACCCAG CTTGGCGCAG CCTCTAGGAA ATGACGCCGT TGCTGCATCC 1680 

GCCGCCCCTT GAGGAAATCG CTGCCATCTT GGCGCGGCTC GGATTGGGGG GCGGACATGA 174 0 

CCTTGATGGA TACAGAATTG CCATGAATGC GGCACTTCCG TCCTTCGCTC GTGTGGAATC 18 00 

GTTGGTAGGT GAGGGTCGAC TGCGGGCGCC AGCTTCCCGA AGAGGTGAAA GGCCCGAG 1858 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 473 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Gin Thr Leu Ser lie Gin His Gly Thr Leu Val Thr Met Asp Gin 

15 10 15 

Tyr Arg Arg Val Leu Gly Asp Ser Trp Val His Val Gin Asp Gly Arg 
20 25 30 

lie Val Ala Leu Gly Val His Ala Glu Ser Val Pro Pro Pro Ala Asp 
35 40 45 

Arg Val lie Asp Ala Arg Gly Lys Val Val Leu Pro Gly Phe lie Asn 
50 55 60 

Ala His Thr His Val Asn Gin lie Leu Leu Arg Gly Gly Pro Ser His 
65 70 75 80 

Gly Arg Gin Phe Tyr Asp Trp Leu Phe Asn Val Val Tyr Pro Gly Gin 
85 90 * 95 

Lys Ala Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu Tyr Cys 
100 105 110 



Ala Glu Ala Val Arg Ser Gly lie Thr Thr lie Asn Glu Asn Ala Asp 
115 120 125 
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Ser Ala lie Tyr Pro Gly Asn lie Glu Ala Ala Met Ala Val Tyr Gly 
130 135 140 

Glu Val Gly Val Arg Val Val Tyr Ala Arg Met Phe Phe Asp Arg Met 
145 150 155 160 

Asp Gly Arg He Gin Gly Tyr Val Asp Ala Leu Lys Ala Arg Ser Pro 
165 170 175 

Gin Val Glu Leu Cys Ser He Met Glu Glu Thr Ala Val Ala Lys Asp 
180 185 190 

Arg He Thr Ala Leu Ser Asp Gin Tyr His Gly Thr Ala Gly Gly Arg 
195 200 205 

lie Ser Val Trp Pro Ala Pro Ala Thr Thr Thr Ala Val Thr Val Glu 
210 215' 220 

Gly Met Arg Trp Ala Gin Ala Phe Ala Arg Asp Arg Ala Val Met Trp 
225 230 235 240 

Thr Leu His Met Ala Glu Ser Asp His Asp Glu Arg He His Gly Met 
245 250 255 

Ser Pro Ala Glu Tyr Met Glu Cys Tyr Gly Leu Leu Asp Glu Arg Leu 
260 265 270 

Gin Val Ala His Cys Val Tyr Phe Asp Arg Lys Asp Val Arg Leu Leu 
275 280 285 

His Arg His Asn Val Lys Val Ala Ser Gin Val Val Ser Asn Ala Tyr 
290 295 300 

Leu Gly Ser Gly Val Ala Pro Val Pro Glu Met Val Glu Arg Gly Met 

305 310 315 320 



Ala Val Gly He 



Met He Gly Asp 
340 

Arg Asp Ala Asp 
355 

He Asp Gly Ala 
370 

Glu Thr Gly Lys 
385 

Arg Arg Leu Ser 



Gly Asn Glu Val 
420 



Gly Thr Asp Asn 
325 

Met Lys Phe Met 



Val Leu Thr Pro 
360 

Arg Ser Leu Gly 
375 

Arg Ala Asp Leu 
390 

His His Leu Ala 
405 

Asp Thr Val Leu 



Gly Asn Ser Asn 
330 

Ala His He His 
345 

Glu Lys He Leu 



Met Asp His Glu 
380 

He Leu Leu Asp 
395 

Ala Thr He Val 
410 

He Asp Gly Asn 
425 



Asp Ser Ala Asn 
335 

Arg Ala Val His 
350 

Glu Met Ala Thr 
365 

He Gly Ser He 



Leu Arg His Leu 
400 

Phe Gin Ala Tyr 
415 

Val Val Met Glu 
430 
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Asn Arg Arg Leu Ser Phe Leu Pro Pro Glu Arg Glu Leu Ala Phe Leu 
435 440 445 

Glu Glu Ala Gin Ser Arg Ala Thr Ala He Leu Gin Arg Ala Asn Met 
450 455 460 

Val Ala Asn Pro Ala Trp Arg Ser Leu 
465 470 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



CO 

;J! (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

id GCGAGCATGG TGACCTTGAC GCCGCTCTTT TCGTTCTCTT TGTTGAACTG CACGCGAAAG 6 0 

□ 

\J GCTTCCAGGT CGGTGATGTC CGCGTCGTCG TGGTTGGTGA CGTGCGGGAT GACCACCCAG 120 

i*3 TTGCGGTGCA GGTTTTTCGA TGGCATAATA TCTGCGTTGC GACGTGTAAC ACACTATTGG 180 

ft 

■5 3 AGACATATCA TGCAAACGCT CAGCATCCAG CACGGTACCC TCGTCACGAT GGATCAGTAC 24 0 
y 

z} CGCAGAGTCC TTGGGGATAG CTGGGTTCAC GTGCAGGATG GACGGATCGT CGCGCTCGGA 300 

S! * GTGCACGCCG AGTCGGTGCC TCCGCCAGCG GATCGGGTGA TCGATGCACG CGGCAAGGTC 36 0 

GTGTTACCCG GTTTCATCAA TGCCCACACC CATGTGAACC AGATCCTCCT GCGCGGAGGG 42 0 

CCCTCGCACG GGCGTCAATT CTATGACTGG CTGTTCAACG TTGTGTATCC GGGACAAAAG 480 

GCGATGAGAC CGGAGGACGT AGCGGTGGCG GTGAGGTTGT ATTGTGCGGA AGCTGTGCGC 540 

AGCGGGATTA CGACGATCAA CGAAAACGCC GATTCGGCCA TCTACCCAGG CAACATCGAG 60 0 

GCCGCGATGG CGGTCTATGG TGAGGTGGGT GTGAGGGTCG TCTACGCCCG CATGTTCTTT 660 

GATCGGATGG ACGGGCGCAT TCAAGGGTAT GTGGACGCCT TGAAGGCTCG CTCTCCCCAA 720 

GTCGAACTGT GCTCGATCAT GGAGGGAACG GCTGTGGCCA AAGATCGGAT CACAGCCCTG 780 

TCAGATCAGT ATCATGGCAC GGCAGGAGGT CGTATATCAG TTTGGCCCGC TCCTGCCACT 840 

ACCACGGCGG TGACAGTTGA AGGAATGCGA TGGGCACAAG CCTTCGCCCG TGATCGGGCG 900 

GTAATGTGGA CGCTTCACAT GGCGGAGAGC GATCATGATG AGCGGATTCA TGGGATGAGT 960 
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CCCGCCGAGT ACATGGAGTG TTACGGACTC TTGGATGAGC GTCTGCAGGT CGCGCATTGC .1020 

GTGTACTTTG ACCGGAAGGA TGTTCGGCTG CTGCACCGCC ACAATGTGAA GGTCGCGTCG 108 0 

CAGGTTGTGA GCAATGCCTA CCTCGGCTCA GGGGTGGCCC CCGTGCCAGA GATGGTGGAG 1140 

CGCGGCATGG CCGTGGGCAT TGGAACAGAT AACGGGAATA GTAATGACTC CGTAAACATG 1200 

ATCGGAGACA TGAAGTTTAT GGCCCATATT CACCGCGCGG TGCATCGGGA TGCGGACGTG 12 60 

CTGACCCCAG AGAAGATTCT TGAAATGGCG ACGATCGATG GGGCGCGTTC GTTGGGAATG 132 0 

GACCACGAGA TTGGTTCCAT CGAAACCGGC AAGCGCGCGG ACCTTATCCT GCTTGACCTG 138 0 

CGTCACCCTC AGACGACTCC TCAC CAT CAT TTGGCGGCCA CGATCGTGTT TCAGGCTTAC 144 0 

GGCAATGAGG TGGACACTGT CCTGATTGAC GGAAACGTTG TGATGGAGAA CCGCCGCTTG 15 00 

AGCTTTCTTC CCCCTGAACG TGAGTTGGCG TTCCTTGAGG AAGCGCAGAG CCGCGCCACA 1560 

GCTATTTTGC AGCGGGCGAA CATGGTGGCT AACCCAGCTT GGCGCAGCCT CTAGGAAATG 1620 

ACGCCGTTGC TGCATCCGCC GCCCCTTGAG GAAATCGCTG CCATCTTGGC GCGGCTCGGA 1680 

TTGGGGGGCG GACATGACCT TGATGGATAC AGAATTGCCA TGAATGCGGC ACTTCCGTCC 174 0 

TTCGCTCGTG TGGAATCGTT GGTAGGTGAG GGTCGACTGC GGGCGCCAGC TTCCCGAAGA 18 00 

AGTGAAAG 18 08 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 4 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

GAGCGCCGCC ACAGCAGCCT TGATCATGAA GGCGAGCATG GTGACCTTGA CGCCGCTCTT 6 0 

TTCGTTCTCT TTGTTGAACT GCACGCGAAA GGCTTCCAGG TCGGTGATGT CCGCGTCGTC 12 0 

GTGGTTGGTG ACGTGCGGGA TGACCACCCA GTTGCGGTGC AGGTTTTTCG ATGGCGTAAT 180 

ATCTGCGTTG CGACGTGTAA CACACTATTG GAGACATATC ATGCAAACGC TCAGCATCCA 240 

GCACGGTACC CTCGTCACGA TGGATCAGTA CCGCAGAGTC CTTGGGGATA GCTGGGTTCA 3 00 

CGTGCAGGAT GGACGGATCG TCGCGCTCGG AGTGCACGCC GAGTCGGTGC CTCCGCCAGC 360 
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GGATCGGGTG ATCGATGCAC GCGGCAAGGT CGTGTTACCC GGTTTCATCA ATGCCCACAC 42 0 

CCATGTGAAC CAGATCCTCC TGCGCGGAGG GCCCTCGCAC GGGCGTCAAT TCTATGACTG 4 80 

GCTGTTCAAC GTTGTGTATC CGGGACAAAA GGCGATGAGA CCGGAGGACG TAGCGGTGGC 540 

GGTGAGGTTG TATTGTGCGG AAGCTGTGCG CAGCGGGATT ACGACGATCA ACGAAAACGC 600 

CGATTCGGCC ATCTACCCAG GCAACATCGA GGCCGCGATG GCGGTCTATG GTGAGGTGGG 660 

TGTGAGGGTC GTCTACGCCfc GCATGTTCTT TGATCGGATG GACGGGCGCA TTCAAGGGTA 72 0 

TGTGGACGCC TTGAAGGCTC GCTCTCCCCA AGTCGAACTG TGCTCGATCA TGGAGGAAAC 7 80 

GGCTGTGGCC AAAGATCGGA TCACAGCCCT GTCAGATCAG TATCATGGCA CGGCAGGAGG 84 0 

TCGTATATCA GTTTGGCCCG CTCCTGCCAC TACCACGGCG GTGACAGTTG AAGGAATGCG 900 

ATGGGCACAA GCCTTCGCCC GTGATCGGGC GGTAATGTGG ACGCTTCACA TGGCGGAGAG 960 

CGATCATGAT GAGCGGATTC ATGGGATGAG TCCCGCCGAT TACATGGAGT GTTACGGACT 102 0 

i 

ffj CTTGGATGAG CGTCTGCAGG TCGCGCATTG CGTGTACTTT GACCGGAAGG ATGTTCGGCT 1080 

f*j GCTGCACCGC CACAATGTGA AGGTCGCGTC GCAGGTTGTG AGCAATGCCT ACCTCGGCTC 114 0 

■I* AGGGGTGGCC CCCGTGCCAG AGATGGTGGA GCGCGGCATG GCCGTGGGCA TTGGAACAGA 12 00 

-« 

TAACGGGAAT AGTAATGACT CCGTAAACAT GATCGGAGAC ATGAAGTTTA TGG C C CAT AT 1260 

:;!! TCACCGCGCG GTGCATCGGG ATGCGGACGT GCTGACCCCA GAGAAGATTC TTGAAATGGC 132 0 

i y 

^ GACGATCGAT GGGGCGCGTT CGTTGGGGAT GGACCACGAG ATTGGTTCCA TCGAAACCGG 13 8 0 

^ CAAGCGCGCG GACCTTATCC TGCTTGACCT GCGTCACCCT CAGACGACTC CTCACCATCA 144 0 

TTTGGCGGCC ACGATCGTGT TTCAGGCTTA CGGCAATGAG GTGGACACTG TCCTGATTGA 1500 

CGGAAACGTT GTGATGGAGA ACCGCCGCTT GAGCTTTCTT CCCCCTGAAC GTGAGTTGGC 156 0 

GTTCCTTGAG GAAGCGCAGA GCCGCGCCAC AGCTATTTTG CAGCGGGCGA ACATGGTGGC 162 0 

TAACCCAGCT TGGCGCAGCC TCTAGGAAAT GACGCCGTTG CTGCATCCGC CGCCCCTTGA 168 0 

GG AAATCGCT GCCATCTTGG CGCGGCTCGG ATTGGGGGGC GGACATGACC TTGATGGATA 1740 

CAGAATTGCC ATGAATGCGG CACTTCCGTC CTTCGCTCGT GTGGAATCGT TGGTAGGTGA 180 0 

GGGTCGACTG CGGGCGCCAG CTTCCCGAAG AAGTGAAAGG CCCGAG 184 6 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 01 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Ala Ser Met Val Thr Leu Thr Pro Leu Phe Ser Phe Ser Leu Leu Asn 
15 10 15 

Cys Thr Arg Lys Ala Ser Arg Ser Val Met Ser Ala Ser Ser Trp Leu 
20 25 30 

Val Thr Cys Gly Met Thr Thr Gin Leu Arg Cys Arg Phe Phe Asp Gly 
35 40 45 

He He Ser Ala Leu Arg Arg Val Thr His Tyr Trp Arg His He Met 
50 55 60 

Gin Thr Leu Ser He Gin His Gly Thr Leu Val Thr Met Asp Gin Tyr 
65 70 75 80 

Arg Arg Val Leu Gly Asp Ser Trp Val His Val Gin Asp Gly Arg He 
85 90 95 

Val Ala Leu Gly Val His Ala Glu Ser Val Pro Pro Pro Ala Asp Arg 
100 105 110 

Val He Asp Ala Arg Gly Lys Val Val Leu Pro Gly Phe He Asn Ala 
115 120 125 

His Thr His Val Asn Gin He Leu Leu Arg Gly Gly Pro Ser His Gly 

130 135 140 

Arg Gin Phe Tyr Asp Trp Leu Phe Asn Val Val Tyr Pro Gly Gin Lys 
145 150 155 160 

Ala Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu Tyr Cys Ala 
165 170 175 

Glu Ala Val Arg Ser Gly He Thr Thr He Asn Glu Asn Ala Asp Ser 
180 185 190 

Ala He Tyr Pro Gly Asn He Glu Ala Ala Met Ala Val Tyr Gly Glu 
195 200 205 

Val Gly Val Arg Val Val Tyr Ala Arg Met Phe Phe Asp Arg Met Asp 
210 215 220 

Gly Arg He Gin Gly Tyr Val Asp Ala Leu Lys Ala Arg Ser Pro Gin 
225 230 235 240 

Val Glu Leu Cys Ser He Met Glu Gly Thr Ala Val Ala Lys Asp Arg 
245 250 255 



•J 
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He Thr Ala Leu Ser Asp Gin Tyr His Gly Thr Ala Gly Gly Arg He. 
260 265 270 

Ser Val Trp Pro Ala Pro Ala Thr Thr Thr Ala Val Thr Val Glu Gly 
275 280 285 

Met Arg Trp Ala Gin Ala Phe Ala Arg Asp Arg Ala Val Met Trp Thr 
290 295 300 

Leu His Met Ala Glu Ser Asp His Asp Glu Arg He His Gly Met Ser 
305 310 315 320 

Pro Ala Glu Tyr Met Glu Cys Tyr Gly Leu Leu Asp Glu Arg Leu Gin 
325 330 335 

Val Ala His Cys Val Tyr Phe Asp Arg Lys Asp Val Arg Leu Leu His 
340 345 350 

Arg His Asn Val Lys Val Ala Ser Gin Val Val Ser Asn Ala Tyr Leu 
355 360 365 

Gly Ser Gly Val Ala Pro Val Pro Glu Met Val Glu Arg Gly Met Ala 
370 375 380 

Val Gly He Gly Thr Asp Asn Gly Asn Ser Asn Asp Ser Val Asn Met 
385 390 395 400 

He Gly Asp Met Lys Phe Met Ala His He His Arg Ala Val His Arg 
405 410 415 

Asp Ala Asp Val Leu Thr Pro Glu Lys He Leu Glu Met Ala Thr He 
420 425 430 

Asp Gly Ala Arg Ser Leu Gly Met Asp His Glu He Gly Ser He Glu 

435 440 445 

Thr Gly Lys Arg Ala Asp Leu He Leu Leu Asp Leu Arg His Pro Gin 
450 455 460 

Thr Thr Pro His His His Leu Ala Ala Thr He Val Phe Gin Ala Tyr 
465 470 475 480 

Gly Asn Glu Val Asp Thr Val Leu He Asp Gly Asn Val Val Met Glu 
485 490 495 

Asn Arg Arg Leu Ser Phe Leu Pro Pro Glu Arg Glu Leu Ala Phe Leu 
500 505 510 

Glu Glu Ala Gin Ser Arg Ala Thr Ala He Leu Gin Arg Ala Asn Met 
515 520 525 

Val Ala Asn Pro Ala Trp Arg Ser Leu Glu Met Thr Pro Leu Leu His 
530 535 540 

Pro Pro Pro Leu Glu Glu He Ala Ala Ile*Leu Ala Arg Leu Gly Leu 
545 550 555 560 
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Gly Gly Gly His Asp Leu Asp Gly Tyr Arg He Ala Met Asn Ala Ala. 

565 570 575 

Leu Pro Ser Phe Ala Arg Val Glu Ser Leu Val Gly Glu Gly Arg Leu 
580 585 590 

Arg Ala Pro Ala Ser Arg Arg Ser Glu 
595 600 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 614 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Ser Ala Ala Thr Ala Ala Leu He Met Lys Ala Ser Met Val Thr Leu 
15 10 15 

Thr Pro Leu Phe Ser Phe Ser Leu Leu Asn Cys Thr Arg Lys Ala Ser 
20 25 30 

Arg Ser Val Met Ser Ala Ser Ser Trp Leu Val Thr Cys Gly Met Thr 
35 40 45 

Thr Gin Leu Arg Cys Arg Phe Phe Asp Gly Val He Ser Ala Leu Arg 

50 55 60 

Arg Val Thr His Tyr Trp Arg His He Met Gin Thr Leu Ser He Gin 
65 70 75 80 

His Gly Thr Leu Val Thr Met Asp Gin Tyr Arg Arg Val Leu Gly Asp 
85 90 95 

Ser Trp Val His Val Gin Asp Gly Arg He Val Ala Leu Gly Val His 
100 105 HO 

Ala Glu Ser Val Pro Pro Pro Ala Asp Arg Val He Asp Ala Arg Gly 
115 120 125 

Lys Val Val Leu Pro Gly Phe He Asn Ala His Thr His Val Asn Gin 
130 135 140 

He Leu Leu Arg Gly Gly Pro Ser His Gly Arg Gin Phe Tyr Asp Trp 
145 150 155 160 

Leu Phe Asn Val Val Tyr Pro Gly Gin Lys Ala Met Arg Pro Glu Asp 
165 170 175 
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Val Ala Val Ala Val Arg Leu Tyr Cys Ala Glu Ala Val Arg Ser Gly 
180 185 190 

He Thr Thr He Asn Glu Asn Ala Asp Ser Ala He Tyr Pro Gly Asn 
195 200 205 

He Glu Ala Ala Met Ala Val Tyr Gly Glu Val Gly Val Arg Val Val 
210 215 220 

Tyr Ala Arg Met Phe Phe Asp Arg Met Asp Gly Arg He Gin Gly Tyr 
225 230 235 240 

Val Asp Ala Leu Lys Ala Arg Ser Pro Gin Val Glu Leu Cys Ser He 
245 250 255 

Met Glu Glu Thr Ala Val Ala Lys Asp Arg He Thr Ala Leu Ser Asp 
260 265 270 

Gin Tyr His Gly Thr Ala Gly Gly Arg He Ser Val Trp Pro Ala Pro 
275 280 285 

Ala Thr Thr Thr Ala Val Thr Val Glu Gly Met Arg Trp Ala Gin Ala 
290 295 300 

Phe Ala Arg Asp Arg Ala Val Met Trp Thr Leu His Met Ala Glu Ser 
305 310 315 320 

Asp His Asp Glu Arg He His Gly Met Ser Pro Ala Asp Tyr Met Glu 
325 330 335 

Cys Tyr Gly Leu Leu Asp Glu Arg Leu Gin Val Ala His Cys Val Tyr 
340 345 350 

Phe Asp Arg Lys Asp Val Arg Leu Leu His Arg His Asn Val Lys Val 

355 360 365 

Ala Ser Gin Val Val Ser Asn Ala Tyr Leu Gly Ser Gly Val Ala Pro 
370 375 380 

Val Pro Glu Met Val Glu Arg Gly Met Ala Val Gly He Gly Thr Asp 
385 390 395 400 

Asn Gly Asn Ser Asn Asp Ser Val Asn Met He Gly Asp Met Lys Phe 
405 410 415 



Met Ala His He His Arg Ala Val His Arg Asp Ala Asp Val Leu Thr 
420 425 430 

Pro Glu Lys He Leu Glu Met Ala Thr He Asp Gly Ala Arg Ser Leu 
435 440 445 

Gly Met Asp His Glu He Gly Ser He Glu Thr Gly Lys Arg Ala Asp 
450 455 460 
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Leu He Leu Leu Asp Leu Arg His Pro Gin Thr Thr Pro His His His 
465 470 475 480 

Leu Ala Ala Thr He Val Phe Gin Ala Tyr Gly Asn Glu Val Asp Thr 
485 490 495 

Val Leu He Asp Gly Asn Val Val Met Glu Asn Arg Arg Leu Ser Phe 
500 505 510 

Leu Pro Pro Glu Arg Glu Leu Ala Phe Leu Glu Glu Ala Gin Ser Arg 
515 520 525 

Ala Thr Ala He Leu Gin Arg Ala Asn Met Val Ala Asn Pro Ala Trp 
530 535 540 

Arg Ser Leu Glu Met Thr Pro Leu Leu His Pro Pro Pro Leu Glu Glu 
545 550 555 560 

He Ala Ala He Leu Ala Arg Leu Gly Leu Gly Gly Gly His Asp Leu 
Q 565 570 575 

lO Asp Gly Tyr Arg He Ala Met Asn Ala Ala Leu Pro Ser Phe Ala Arg 

i;n 580 585 590 

ijt Val Glu Ser Leu Val Gly Glu Gly Arg Leu Arg Ala Pro Ala Ser Arg 

595 600 605 

Arg Ser Glu Arg Pro Glu 
610 



r y 
in 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGGTATCGGG GAATTC TTGA GCGCGGCCAC AGCAGCCNTG ATCATGAAGG CGAGCATGGT 60 

GACCTNGACG CCGTNTTTTN GTTNTTTTTT GTTGAACTGC ACGCGAAAGG TTCCAGGTCG 120 

GTGATGTCCG CGTCGTCGTG GTTGGTGACG TGCGGGATGA CCACCCAGNT GCGGTGCAGG 180 

TTTTTCGATG GCATAATATC TGCGTTGCGA CGTGTAACAC ACTANTGGAG ACATATCATG 240 

CAAACGCTCA GCATCCAGCA CGGTACCCTC GTCACGATGG ATCAGTACCG CAGAGTCCTT 300 

GGGGATAGCT GGGTTCACGT GCAGGATGGA CGGATCGTCG CGCTCGGAGT GCACGCCGAG 360 



/ 
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TCGGTGCCTC CGC CAGCGGA TCGGGTGATC GATGCACGCG GCAAGGTCGT GTTACCCGGT 420 
TTCATCAATG CCCACACCCA TGTGAACCAG ATCCTCCTGC GCGGAGGGCC CTCGCACGGG 480 
CGTCAATTCT ATGACTGGCT GTTCAACGTT GTGTATCCGG GACAAAAGGC GATGAGACCG 540 
GAGGA 545 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

'sit? 

CO (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

iin CCTGCGCGGA GGGCCTCCGC ACGGGCGTCA ATTCTATGAC TGGCTGTTCA ACGTTGTGTA 60 

Q TCCGGGACAA AAGGCGATGA GACCGGAGGA CGTAGCGGTG GCGGTGAGGT TGTATTGTGC 120 

'""4 

; S GGAAGCTGTG CGCAGCGGGA TTACGACGAT CAACGAAAAC GCCGATTCGG CCATCTACCC 18 0 

jjj- AGGCAACATC GAGGCCGCGA TGGCGGTCTA TGGTGAGGTG GGTGTGAGGG TCGTCTACGC 240 

]f| CCGCATGTTC TTTGATCGGA TGGACGGGCG CATTCAAGGG TATGTGGACG CCTTGAAGGC 3 00 

■I s "? TCGCTCTCCC CAAGTCGAAC TGTGCTCGAT CATGGAGGAA ACGGCTGTGG CCAAAGATCG 36 0 

GATCACAGCC CTGTCAGATC AGTATCATGG CACGGCAGGA GGTCCTATAT CAGTTTGGCC 420 

CGCTCCTGCC ACTACCACGG CGGTGACATT TAAANGAATC CATGGGCCAA CCTCCCCCGT 4 80 

GATCCGGCGG TAATGTGAC 4 99 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
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TNGCAGGTTG TGAGCATGCT ACTTCGGTTC AGGNGTGGCC CCCGTGCCAG AGATGGTGGA 
GCGCGGCATG G CCGTGGGC A TTGGAACAGA TAACGGGAAT AGTAATGACT CCGTAAACAT 
GATCGGAGAC ATGAAGTTTA TGGCCCATAT TCACCGCGCG GTGCATCGGG ATGCGGACGT 
GCTGACCCCA GAGAAGATTN TTGAAATGGC GACGATCGAT GGGGCGCGTT TCGTTGGGGA 
TGGACCACGA GATTGGTTCC ATCGAAACCG GCAAGCGCGC GGACCTTATC CTGCTTGACC 
TGCGTCACCC TCAGACGACT CCTCACCATC ATTTGGCGGC CACGATCGTG TTTCAGGCTT 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 443 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CGGCCACGAT CGTGTTTCAG GCTTACGGCA ATGAGGTGGA CACTGTCCTG ATTGACGGAA 
ACGTTGTGAT GGAGAACCGC CGCTTGAGCT TTCTTCCCCC TGAACGTGAG TTGGCGTTCC 
TTGAGGAAGC GCAGAGCCGC GCCACAGCTA TTTTGCATCG GGCGAAACAT GGTGGCTAAC 
CCAGCTTGGC GCAGCCTCTA GGAAATGACG CCGTTGCTGC ATCCGCCGCC CCTTGAGGAA 
ATCGCTGCCA TCTTGGCGCG GCTCGGATTG GGGGGCGGAC ATGAC CTTG A TGGATACAGA 
ATTGCCATGA ATGCGGCACT TCCGTCCTTC GCTCGTGTGG AATCGTTGGT AGGTGAGGGT 
CGACTGCGGG CGCCAGCTTC CCGAAGAGGT GAAAGCCCGA GGATCCTCTA GAGTCCGATT 
TTTCCGATGT CATCACCGGC GCG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



'A 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
CCTGCGCGGA GGCCTCCGCA CGGGCGTCAA TTCTATGACT GGCTGTTCAA CGTTGTGTAT 
CCGGGACAAA AGGCGATGAG ACCGGAGGAC GTANCGGTGG CGGTGAGGTT GTATTGTGCG 
GAAGCTGTGC GCAGCGGGAT TACGACGATC AACGAAAACG CCGATTCGGC CATCTACCCA 
GGCAACATCG AGGCCGCGAT GGCGGTCTAT GGTGAGGTGG GTGTGAGGGT CGTCTACGCC 
CGCATGTTCT TTGAT CGGAT GGACGGGCGC ATTCAAGGGT ATGTGGACGC CTTGAAGGCT 
CGCTCTCCCC AAGTCGAACT GTGCTCGATC ATGGAGGAAA CGGCTGTGGC CAAAGATCGG 
ATCACANCCC TGTCAGATCA NTATCATGGC ACGGCANGAG GTCCTATATC ANTTTGGCCC 
GCTCCTGCCA CTACCACNGC GGTGACATTT NAANGAATTC CATNGGCACA ACCTTCCCCC 
GTGATCNGGC GGTAATGTNG ACCCA 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DEI 

Pro His Gly Arg 
1 

Gly Gin Lys Ala 
20 

Tyr Cys Ala Glu 
35 

Ala Asp Ser Ala 
50 

Tyr Gly Glu Val 
65 

Arg Met Asp Gly 



Ser Pro Gin Val 
100 



1CRIPTION: SEQ I] 

Gin Phe Tyr Asp 
5 

Met Arg Pro Glu 



Ala Val Arg Ser 
40 

lie Tyr Pro Gly 
55 

Gly Val Arg Val 
70 

Arg lie Gin Gly 
85 

Glu Leu Cys Ser 



) NO: 12 : 

Trp Leu Phe Asn 
10 

Asp Val Ala Val 
25 

Gly lie Thr Thr 



Asn lie Glu Ala 
60 

Val Tyr Ala Arg 
75 

Tyr Val Asp Ala 
90 

lie Met Glu Glu 
105 



Val Leu Tyr Pro 
15 

Ala Val Arg Leu 
30 

lie Asn Glu Asn 
45 

Ala Met Ala Val 



Met Phe Phe Asp 
80 

Leu Lys Ala Arg 
95 

Thr Ala Val Ala 
110 
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Lys Asp Arg He Thr Ala Leu Ser Asp Gin Tyr His Gly Thr Ala Gly 
115 120 125 



Gly Arg He Ser Val Trp Pro Ala Pro Ala Thr Thr Thr Ala Val Thr 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Ser His Gly Arg Gin Phe Tyr Asp Trp Leu Phe Asn Val Leu Tyr Pro 
15 10 15 

Gly Gin Lys Ala Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu 
20 25 30 



□ Tyr Cys Ala Glu Ala Val Arg Ser Gly He Thr Thr He Asn Glu Asn 

! a H 35 40 45 

• y 

i f! Ala Asp Ser Ala He Tyr Pro Gly Asn He Glu Ala Ala Met Ala Val 

50 55 60 

Tyr Gly Glu Val Gly Val Arg Val Val Tyr Ala Arg Met Phe Phe Asp 
65 70 75 80 

Arg Met Asp Gly Arg He Gin Gly Tyr Val Asp Thr Leu Lys Ala Arg 
85 90 95 

Ser Pro Gin Val Glu Leu Cys Ser lie Met Glu Glu Thr Ala Val Ala 
100 105 110 

Lys Asp Arg He Thr Ala Leu Ser Asp Gin Tyr His Gly Thr Ala Gly 
115 120 125 

Gly Arg He Ser Val Trp Pro Ala Pro Ala Thr Thr Thr Ala Val Thr 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 



'J 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 14: 

Pro His Gly Arg Gin Phe Tyr Asp Trp Leu Phe Asn Val Val Tyr Pro 
1 5 10 15 

Gly Gin Lys Ala Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu 
20 25 30 

Tyr Cys Ala Glu Ala Val Arg Ser Gly lie Thr Thr lie Asn Glu Asn 
35 40 45 

Ala Asp Ser Ala lie Tyr Pro Gly Asn He Glu Ala Ala Met Ala Val 
50 55 60 

Tyr Gly Glu Val Gly Val Arg Val Val Tyr Ala Arg Met Phe Phe Asp 
65 70 75 80 

Arg Met Asp Gly Arg He Gin Gly Tyr Val Asp Ala Leu Lys Ala Arg 
85 90 95 

Ser Pro Gin Val Glu Leu Cys Ser He Met Glu Glu Thr Ala Val Ala 
100 105 110 

Lys Asp Arg He Thr Ala Leu Ser Asp Gin Tyr His Gly Thr Ala Gly 
115 120 125 

Gly Arg He Ser Val Trp Pro Ala Pro Ala Thr Thr Thr Ala Val Thr 

130 135 140 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Ser His Gly Arg Gin Phe Tyr Asp Trp Leu Phe Asn Val Leu Tyr Pro 
15 10 15 



J 
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Gly Gin Lys Ala Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu 
20 25 30 

Tyr Cys Ala Glu Ala Val Arg Ser Gly lie Thr Thr lie Asn Glu Asn 
35 40 45 

Asn Ala Asp Ser Ala lie Tyr Pro Gly Asn lie Glu Ala Ala Met Ala 
50 55 60 

Val Tyr Gly Glu Val Gly Val Arg Val Val Tyr Ala Arg Met Phe Phe 
65 70 75 80 

Asp Arg Met Asp Gly Arg lie Gin Gly Tyr Val Asp Thr Leu Lys Ala 
85 90 95 

Arg Ser Pro Gin Val Glu Leu Cys Ser lie Met Glu Glu Thr Ala Val 
100 105 110 

Ala Lys Asp Arg lie Thr Ala Leu Ser Asp Gin Tyr His Gly Thr Ala 
115 120 125 

Gly Gly Arg lie Ser Val Trp Pro Ala Pro Ala Thr Thr Thr Ala Val 
130 135 140 

Thr 
145 

INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

Ser His Gly Arg Gin Phe Tyr Asp Trp Leu Phe Asn Val Val Tyr Pro 
15 10 15 

Gly Gin Lys Ala Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu 
20 25 30 

Tyr Cys Ala Glu Ala Val Arg Ser Gly lie Thr Thr lie Asn Glu Asn 
35 40 45 

Ala Asp Ser Ala lie Tyr Pro Gly Asn He Glu Ala Ala Met Ala Val 
50 55 60 



Tyr Gly Glu Val Gly Val Arg Val Val Tyr Ala Arg Met Phe Phe Asp 
65 70 75 > 80 



J 



0 
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Arg Met Asp Gly Arg lie Gin Gly Tyr Val Asp Ala Leu Lys Ala Arg 
85 90 95 

Ser Pro Gin Val Glu Leu Cys Ser lie Met Glu Glu Thr Ala Val Ala 
100 105 110 



Lys Asp Arg He Thr Ala Leu Ser Asp Gin Tyr His Gly Thr Ala Gly 
115 120 125 

Gly Arg He Ser Val Trp Pro Ala Pro Ala Thr Thr Thr Ala Val Thr 
130 135 140 



(2) INFORMATION FOR SEQ ID NO : 17 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CGCGAAAGGC TTCCAGGTCG GTGATGTCCG CGTCGTCGTG GTTGGTGACG TG CGGGATGA 60 

CCACCCAGTC GCGGTGCAGG TTTTTCGATG GCATAATATC TGCGTTGCGA CGTGTAACAC 12 0 

ACTATTGGAG ACATATCATG CAAACGCTCA GCATCCAGCA CGGTACCCTC GTCACGATGG 18 0 

ATCAATACCG CAGAGTCCTT GGGGATAGCT GGGTTCACGT GCAGGATGGA CGGATCGTCG 24 0 

CGCTCGGAGT GCACGCCAAG TCGGTGCCTC CGCCAGCGGA TCGGGTGATC GATGCACGCG 300 

GCAAGGTCGT GTTACCCGGT TTCATCAATG CCCACACCCA TGTGAACCAG ATCCTCCTGC 360 

GCGGAGGGCC CTCGCACGGG CGTCAATTCT ATGACTGGCT GTTCAACGTT GTGTATCCGG 42 0 

GACAAAAGGC GATGAGACCG GAGGACGTAG CGGTGGCGGT GAGGTTGTAT TGTGCGGAAG 480 

CTGTGCGCAG CGGGATTACG ACGATCAACG AAAACGCCGA TTCGGCCATC TACCCAGGCA 540 

ACATCGAGGC CGCGATGGCG GTCTATGGTG AGGTGGGTGT GAGGGTCGTC TACGCCCGCA 600 

TGTTCTTTGA TCGGATGGAC GGGCGCATTC AAGGGTATGT GGACGCCTTG AAGGCTCGCT 660 

CTCCCCAAGT CGAACTGTGC TCGATCATGG AGGAAACGGC TGTGGCCAAA GATCGGATCA 720 
CAGCCCTGTC AGATCAGTAT CATGGCACGG CAGGAGGTCG TATATCAGTT TGGCCCGCTC . 780 

CTGCCACTAC CACGGCGGTG ACAGTTGAAG GAATGCGATG GGCACAAGCC TTCGCCCGTG . ,840 
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ATCGGGCGGT AATGTGGACG 
GGATGAGTCC CGCCGAGTAC 
CGCATTGCGT GTACTTTGAC 
TCGCGTCGCA GGTTGTGAGC 
TGGTGGAGCG CGGCATGGCC 
TAAACATGAT CGGAGACATG 
CGGACGTG CT GACCCCAGAG 
TGGGGATGGA CCACGAGATT 
TTGACCTGCG TCACCCTCAG 
AGGCTTACGG CAATGAAGTG 
GCTGCTTGAG CTTTCTTCCC 
GCGCCACAGC TATTTTGCAG 
AGGAAATGAC GCCGTTGCTG 
GGCTCGGATT GGG 
(2) INFORMATION FOR SEQ ID NO: 18: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TCGTGGTTGG TGACGTGCGG GATGACCACC CAGTCGCGGT GCAGGTTTTT CGATGGCATA 
ATATCTGCGT TGCGACGTGT AACACACTAT TGGAGACATA TCATGCAAAC GCTCAGCATC 
CAGCACGGTA CCCTCGTCAC GATGGATCAG TACCGCAGAG TCCTTGGGGA TAGCTGGGTT 
CACGTGCAGG ATGGACGGAT CGTCGCGCTC GGAGTGCACG CCGAGTCGGT GCCTCCGCCA 
GCGGATCGGG TGATCGATGC ACGCGGCAAG GTCGTGTTAC CCGGTTTCAT CAATGCCCAC 
ACCCATGTGA ACCAGATCCT CCTGCGCGGA GGGCCCTCGC ACGGGCGTCA ATTCTATGAC 
TGGCTGTTCA ACGTTGTGTA TCCGGGACAA AAGGCGATGA GACCGGAGGA CGTAGCGGTG 



CTTCACATGG CGGAGAGCGA 
ATGGAGTGTT ACGGACTCTT 
CGGAAGGATG TTCGGCTGCT 
AATGCCTACC TCGGCTCAGG 
GTGGGCATTG GAACAGATAA 
AAGTTTATGG CCCATATTCA 
AAGATTCTTG AAATGGCGAC 
GGTTCCATCG AAACCGGCAA 
ACGACTCCTC ACCATCATTT 
GACACTGTCC TGATTGACGG 
CCTGAACGTG AGTTGGCGTT 
CGGGCGAACA TGGTGGCTAA 
CATCCGCCGC CCCTTGAGGA 



TCATGATGGG CGGATTCATG 
GGATGAGCGT CTGCAGGTCG 
GCACCGCCAC AATGTGAAGG 
GGTGGCCCCC GTGCCAGAGA 
CGGGAATAGT AATGACTCCG 
CCGCGCGGTG CATCGGGATG 
GATCGATGGG GCGCGTTCGT 
GCGCGCGGAC CTTATCCTGC 
GGCGGCCACG ATCGTGTTTC 
AAACGTTGTG ATGGAGAACC 
CCTTGAGGGA GCGCAGAGCC 
CCCAGCTTGG CGCAGCCTCT 
AATCGCTGCC ATCTTGGCGC 
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GCGGTGAGGT TGTATTGTGC GGAAGCTGTG CGCAGCGGGA TTACGACGAT CAACGAAAAC 480 

GCCGATTCGG CCATCTACCC AGGCAACATC GAGGCCGCGA TGGCGGTCTA TGGTGAGGTG 540 

GGTGTGAGGG TCGTCTACGC CCGCATGTTC TTTGATCGGA TGGACGGGCG CATTCAAGGG 600 

TATGTGGACG CCTTGAAGGC TCGCTCTCCC CAAGTCGAAC TGTGCTCGAT CATGGAGGAA 660 

ACGGCTGTGG CCAAAGATCG GATCACAGCC CTGTCAGATC AGTATCATGG CACGGCAGGA 720 

GGTCGTATAT CAGTTTGGCC CGCTCCTGCC ACTACCACGG CGGTGACAGT TGAAGGAATG 780 

CGATGGGCAC AAGCCTTCGC CCGTGATCGG GCGGTAATGT GGACGCTTCA CATGGCGGAG 840 

AGCGATCATG ATGAGCGGAT TCATGGGATG AGTCCCGCCG AGTACATGGA GTGTCACGGA 900 

CTCTTGGATG AGCGTCTGCA GGTCGCGCAT TGCGTGTACT TTGACCGGAA GGATGTTCGG 960 

CTGCTGCACC GCCACAATGT GAAGGTCGCG TCGCAGGTTG TGAGCAATGC CTACCTCGGC 102 0 

v3 TCAGGGGTGG CCCCCGTGCC AGAGATGGTG GAGCGCGGCA TGGCCATGGG CATTGGAACA 1080 

: ; IJ 

Cf! GATAACGGGA ATAGTAATGA CTCCGTAAAC ATGATCGGAG ACATGAAGTT TATGGCCCAT 114 0 

! a J ATTCACCGCG CGGTGCATCG GGATGCGGAC GTGCTGACCC CAGAGAAGAT TCTTGAAATG 1200 

\l GCGACGATCG ATGGGGCGCG TTCGTTGGGA ATGGACCACG AGATTGGTTC CATCGAAACC 1260 

□ GGCAAGCGCG CGGACCTTAT CCTGCTTGAC CTGCGTCACC CTCAGACGAC TCCTCACCAT 132 0 

i'Sj CATTTGGCGG CCACGATCGT GTTTCAGGCT TACGGCAATG AGGTGGACAC TGTCCTGATT 138 0 

: is? 
I ft 

GACGGAAACG TTGTGATGGA GAACCGCCGC TTGAGCTTTC TTCCCCCTGA ACGTGAGTTG 1440 

'"*** GCGTTCCTTG AGGAAGCGCA GAGCCGCGCC ACAGCTATTT TGCAGCGGGC GAACATGGTG 150 0 

GCTAACCCAG CTTGGCGCAG CCTCTAGGAA ATGACGCCGT TGCTGCATCC GCCGCCCCTT 1560 

GAGGAAATCG CTGCCATCTT GGCGCGGCTC GGATTGGG 1598 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1586 base pairs 
y " (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
ACGTGCGGGA TGACCACCCA GTTGCGGTGC AGGTTTTTCG ATGGCGTAAT ATCTGCGTTG 



60 
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CGACGTGTAA CACACTATTG GAGACATATC ATGCAAACGC TCAGCATCCA GCACGGTACC 
CTCGTCACGA TGGATCAGTA CCGCAGAGTC CTTGGGGATA GCTGGGTTCA CGTGCAGGAT 
GGACGGATCG TCGCGCTCGG AGTGCACGCC GAGTCGGTGC CTCCGCCAGC GGATCGGGTG 
ATCGATGCAC GCGGCAAGGT CGTGTTACCC GGTTTCATCA ATGCCCACAC CCATGTGAAC 
CAGATCCTCC TGCGCGGAGG GCCCTCGCAC GGGCGTCAAT TCTATGACTG GCTGTTCAAC 
GTTGTGTATC CGGGACAAAA GGCGATGAGA CCTGAGGACG TAGCGGTGGC GGTGAGGTTG 
TATTGTGCGG AAGCTGTGCG CAGCGGGATT ACGACGATCA ACGAAAACGC CGATTCGGCC 
ATCTACCCAG GCAACATCGA GGCCGCGATG GCGGTCTATG GTGAGGTGGG TGTGAGGGTC 
GTCTACGCCC GCATGTTCTT TGATCGGATG GACGGGCGCA TTCAAGGGTA TGTGGACGCC 
TTGAAGGCTC GCTCTCCCCA AGTCGAACTG TGCTCGATCA TGGAGGAAAC GGCTGTGGCC 
AAAGATCGGA TCACAGCCCT GTCAGATCAG TATCATGGCA CGGCAGGAGG TCGTATATCA 
GTTTGGCCCG CTCCTGCCAC TACCACGGCG GTGACAGTTG AAGGAATGCG ATGGGCACAA 
GCCTTCGCCC GTGATCGGGC GGTAATGTGG ACGCTTCACA TGGCGGAGAG CGATCATGAT 
GAGCGGATTC- ATGGGATGAG TCCCGCCGAG TACATGGAGT GTTACGGACT CTTGGATGAG 
CGTCTGCAGG TCGCGCATTG CGTGTACTTT GACCGGAAGG ATGTTCGGCT GCTGCACCGC 
CACAATGTGA AGGTCGCGTC GCAGGTTGTG AGCAATGCCT ACCTCGGCTC AGGGGTGGCC 
CCCGTGCCAG AGATGGTGGA GCGCGGCATG GCCGTGGGCA TTGGAACAGA TAACGGGAAT 
AGTAATGACT CCGTAAACAT GATCGGAGAC ATGAAGTTTA TGGCCCATAT TCACCGCGCG 
GTGCATCGGG ATGCGGACGT GCTGACCCCA GAGAAGATTC TTGAAATGGC GACAATCGAT 
GGGGCGCGTT CGTTGGGAAT GGACCACGAG ATTGGTTCCA TCGAAACCGG CAAGCGCGCG 
GACCTTATCC TGCTTGACCT GCGTCACCCT CAGACGACTC CTCACCATCA TTTGGCGGCC 
ACGATCGTGT TTCAGGCTTA CGGCAATGAG GTGGACACTG TCCTGATTGA CGGAAACGTT 
GTGATGGAGA ACCGCCGCTT GAGCTTTCTT CCCCCTGAAC GTGAGTTGGC GTTCCTTGAG 
GAAGCGCAGA GCCGCGCCAC AGCTATTTTG CAGCGGGCGA ACATGGTGGC TAACCCAGCT 
TGGCGCAGCC TCTAGGAAAT GACGCCGTTG CTGCATCCGC TGCC CCTTG A GGAAATCGCT 
GCCATCTTGG CGCGGCTCGG ATTGGG 
(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1597 base pairs 
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(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

CGTGGTTGGT GACGTGGGGG ATGACCACCC AGTCGCGGTG CAGGTTTTTC GATGGCATAA 6 0 

TATCTGCGTT GCGACGTGTA ACACACTATT GGAGACATAT CATGCAAACG CTCAGCATCC 12 0 

AGCACGGTAC CCTCGTCACG ATGGATCAGT AC CGCAG AGT CCTTGGGGAT AGCTGGGTTC 180 

ACGTGCAGGA TGGACGGATC GTCG CGCTCG GAGTGCACGC CGAGTCGGTG CCTCCGCCAG 240 

CGGATCAGGT GATCGATGCA CGCGGCAAGG TCGTGTTACC CGGTTTCATC AATGCCCACA 3 00 

CCCATGTGAA CCAGATCCTC CTGCGCGGAG GGCCCTCGCA CGGGCGTCAA TTCCATGACT 360 

GGCTGTTCAA CGTTGTGTAT CCGGGACAAA AGGCGATGAG ACCGGAGGAC GTAGCGGTGG 420 

CGGTGAGGTT GTATTGTGCA GAAGCTGTGC GCAGCGGGAT TACGACGATT AACGAAAACG 480 

CCGATTCGGC CATCTACCCA GGCAACATCG AGGCCGCGAT GGCGGTCTAT GGTGAGGTGG 54 0 

GTGTGAGGGT CGTCTACGCC CGCATGTTCT TTGATCGGAT GGACGGGCGC ATTCAAGGGT 600 

ATGTGGACGC CTTGAAGGCT CGCTCTCCCC AAGTCGAACT GTGCTCGATC ATGGAGGAAA 66 0 

CGGCTGTGGC CAAAGATCGG ATCACAGCCC TGTCAGATCA GTATCATGGC ACGGCAGGAG 720 

GTCGTATATC AGTTTGGCCC GCTCCTGCCA CTACCACGGC GGTGACAGTT GAAGGAATGC 780 

GATGGGCACA AGCCTTCGCC CGTGATCGGG CGGTAATGTG GACGCTTCAC ATGGCGGAGA 840 

GCGATCATGA TGGGCGGATT CATGGGATGA GTCCCGCCGA GTACATGGAG TGTTACGGAC 900 

TCTTGGATGA GCGTCTGCAG GTCGCGCATT GCGTGTACTT TGACCGGAAG GATGTTCGGC 960 

TGCTGCACCG CCACAATGTG AAGGTCGCGT CGCAGGTTGT GAGCAATGCC TACCTCGGCT 1020 

CAGGGGTGGC CCCCGTGCCA GAGATGGTGG AGCGCGGCAT GGCCGTGGGC ATTGGAACAG 1080 

ATAACGGGAA TAGTAATGAC TCCGTAAACA TGATCGGAGA CATGAAGTTT ATGGCCCATA 1140 

TTCACCGCGC GGTGCATCGG GATGCGGACG TGCTGACCCC AGAGAAGATT CTTGAAATGG 1200 

CAACGATCGA TGGGGCGCGT TCGTTGGGAA TGGACCACGA GATTGGTTCC ATCGAAACCG 1260 

GCAAGCGCGC GGACCTTATC CTGCTTGACC TGCGTCACCC TCAGACGACT CCTCACCATC 1320 

ATTTGGCGGC CACGATCGTG TTTCAGGCTT ACGGCAATGA GGTGGACACT GTCCTGATTG 1380 
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ACGGAAACGT TGTGATGGAG AACCGCCGCT TGAGCTTTCT TCCCCCTGAA CGTGAGTTGG 
CGTTCCTTGA GGAAGCGCAG AGCCGCGCCA CAGCTATTTT GCAGCGGGCG AACATGGTGG 
CTAACCCAGC TTGGCGCAGC CTCTAGGAAA TGACGCCGTT GCTGCATCCG CCGCCCCTTG 
AGGAAATCGC TGCCATCTTG GCGCGGCTCG GATTGGG 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1674 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GTGACCTTGA CGCCGCTCTT TTCGTTCTCT TTGTTGAACT GCACGCGAAT GGCTTCCAGT 
TCGATGATGT CCGCGTCGTC GTGGTTGGTG ACGTGCGGGA TGACCACCCA GTCGCGGTGC 
AGGTTTTTCG ATGGCATAAT ATCTGCGTTG CGACGTGTAA CACACTATTG GAGACATATC 
ATGCAAACGC TCAGCATCCA GCACGGTACC CTCGTCACGA TGGATCAGTA CCGCAGAGTC 
CTTGGGGATA GCTGGGTTCA CGTGCAGGAT GGACGGATCG TCGCGCTCGG AGTGCACGCC 
GAGTCGGTGC CTCCGCCAGC GGATCGGGTG ATTGATGCAC GCGGCAAGGT CGTGTTACCC 
GGTTTCATCA ATGCCCACAC CCATGTGAAC CAGATCCTCC TGCGCGGAGG CCTCGCACGG 
GCGTCAATTC TATGACTGGC TGTTCAACGT TGTGTATCCG GGACAAAAGG CGATGAGACC 
GGAGGACGTA GCGGTGGCGG TGAGGTTGTA TTGTGCGGAA GCTGTGCGCA GCGGGATTAC 
GACGATCAAC GAAAACGCCG ATTCGGCCAT CTACCCAGGC AACATCGAGG CCGCGATGGC 
GGTCTATGGT GAGGTGGGTG TGAGGGTCGT CTACGCCCGC ATGTTCTTTG ATCGGATGGA 
CAGGCGCATT CAAGGGTATG TGGACGCCTT GAAGGCTCGC TCTCCCCAAG TCGAACTGTG 
CTCGATCATG GAGGAAACGG CTGTGGCCAA AGATCGGATC ACAGCCCTGT CAGATCAGTA 
TCATGGCACG GCAGGAGGTC GTATATCAGT TTGGCCCGCT CCTGCCACTA CCACGGCGGT 
GACAGTTGAA GGAATGCGAT GGGCACAAGC CTTCGCCCGT GATCGGGCGG TAATGTGGAC 
GCTTCACATG GCGGAGAGCG ATCATGATGA GCGGATTCAT GGGATGAGTC CCGCCGAGTA 
CATGGAGTGT TACGGACTCT TGGATGAGCG TCTGCAGGTC GCGCATTGCG TGTACTTTGA 
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CCGGAAGGAT ATTCGGCTGC TGCACCGCCA CAATGTGAAG GTCGCGTCGC AGG CTGTGAG 1080 

CAATGCCTAC CTCGGCTCAG GGGTGGCCCC CGTGCCAGAG ATGGTGGAGC GCGGCATGGC 1140 

CGTGGGCATT GGAACAGATA ACGGGAATAG TAATGACTCC GTAAACATGA TCGGAGACAT 1200 

GAAGTTTATG GCC CATATTC ACCGCGCGGT GCATCGGGAT GCGGACGTGC TGACCCCAGA 1260 

GAAGATTCTT GAAATGGCGA CGATCGATGG GGCGCGTTCG TTGGGAATGG ACCACGAGAT 132 0 

TGGTTCCATC GAAACCGGCA AGCGCGCGGA CCTTATCCTG CTTGACCTGC GTCACCCTCA 138 0 

GACGACTCCT CACCATCATT TGGCGGCCAC GATCGTGTTT CAGGCTTACG GCAATGAGGT 1440 

GGACACTGTC CTGATTGACG GAAACGTTGT GATGGAGAAC CGCCGCTTGA GCTTTCTTCC 1500 

CCCTGAACGT GAGTTGGCGT TCCTTGAGGA AGCGCAGAGC CGCGCCACAG CTATTTTGCA 1560 

GCGGGCGAAC ATGGTGGCCA ACCCAGCTTG GCGCAGCCTC TAGGAAATGA CGCCGTTGCT 162 0 

GCATCCGCCG CCCCTTGAGG AAATCGCTGC CATCTTGGCG CAGCTCGGAT TGGG 1674 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 496 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Met Gin Thr Leu Ser lie Gin His Gly Thr Leu Val Thr Met Asp Gin 
15 10 15 

Tyr Arg Arg Val Leu Gly Asp Ser Trp Val His Val Gin Asp Gly Arg 
20 25 30 

lie Val Ala Leu Gly Val His Ala Lys Ser Val Pro Pro Pro Ala Asp 
35 40 45 

Arg Val He Asp Ala Arg Gly Lys Val Val Leu Pro Gly Phe He Asn 
50 55 60 

Ala His Thr His Val Asn Gin He Leu Leu Arg Gly Gly Pro Ser His 
65 70 75 80 



Gly Arg Gin Phe Tyr Asp Trp Leu Phe Asn Val Val Tyr Pro Gly Gin 
85 90 95 



t 



60 

Lys Ala Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu Tyr Cys 
100 105 HO 

Ala Glu Ala Val Arg Ser Gly He Thr Thr He Asn Glu Asn Ala Asp 
115 120 125 

Ser Ala He Tyr Pro Gly Asn He Glu Ala Ala Met Ala Val Tyr Gly 
130 135 140 

Glu Val Gly Val Arg Val Val Tyr Ala Arg Met Phe Phe Asp Arg Met 
145 150 155 160 

Asp Gly Arg He Gin Gly Tyr Val Asp Ala Leu Lys Ala Arg Ser Pro 
165 170 175 

Gin Val Glu Leu Cys Ser He Met Glu Glu Thr Ala Val Ala Lys Asp 
180 185 190 

Arg He Thr Ala Leu Ser Asp Gin Tyr His Gly Thr Ala Gly Gly Arg 
195 200 205 

He Ser Val Trp Pro Ala Pro Ala Thr Thr Thr Ala Val Thr Val Glu 
210 215 220 

Gly Met Arg Trp Ala Gin Ala Phe Ala Arg Asp Arg Ala Val Met Trp 
225 230 235 240 

Thr Leu His Met Ala Glu Ser Asp His Asp Gly Arg He His Gly Met 
245 250 255 

Ser Pro Ala Glu Tyr Met Glu Cys Tyr Gly Leu Leu Asp Glu Arg Leu 
260 265 270 

Gin Val Ala His Cys Val Tyr Phe Asp Arg Lys Asp Val Arg Leu Leu 

275 280 285 

His Arg His Asn Val Lys Val Ala Ser Gin Val Val Ser Asn Ala Tyr 
290 295 300 

Leu Gly Ser Gly Val Ala Pro Val Pro Glu Met Val Glu Arg Gly Met 
305 310 315 320 

Ala Val Gly He Gly Thr Asp Asn Gly Asn Ser Asn Asp Ser Val Asn 
325 330 335 

Met He Gly Asp Met Lys Phe Met Ala His He His Arg Ala Val His 
340 345 350 

Arg Asp Ala Asp Val Leu Thr Pro Glu Lys He Leu Glu Met Ala Thr 
355 360 365 

He Asp Gly Ala Arg Ser Leu Gly Met Asp His Glu He Gly Ser He 
370 375 380 

Glu Thr Gly Lys Arg Ala Asp Leu He Leu Leu Asp Leu Arg His Pro 
385 390 395 400 
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Gin Thr Thr Pro His His His Leu Ala Ala Thr lie Val Phe Gin Ala 
405 410 415 

Tyr Gly Asn Glu Val Asp Thr Val Leu lie Asp Gly Asn Val Val Met 
420 425 430 

Glu Asn Arg Cys Leu Ser Phe Leu Pro Pro Glu Arg Glu Leu Ala Phe 
435 440 445 

Leu Glu Gly Ala Gin Ser Arg Ala Thr Ala lie Leu Gin Arg Ala Asn 
450 455 460 

Met Val Ala Asn Pro Ala Trp Arg Ser Leu Glu Met Thr Pro Leu Leu 
465 470 475 480 

His Pro Pro Pro Leu Glu Glu He Ala Ala He Leu Ala Arg Leu Gly 
485 490 495 



INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 496 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DEI 

Met Gin Thr Leu 
1 

Tyr Arg Arg Val 
20 

He Val Ala Leu 
35 

Arg Val He Asp 
50 

Ala His Thr His 
65 

Gly Arg Gin Phe 



Lys Ala Met Arg 
100 



ICRIPTION: SEQ II 

Ser He Gin His 
5 

Leu Gly Asp Ser 



Gly Val His Ala 
40 

Ala Arg Gly Lys 
55 

Val Asn Gin He 
70 

Tyr Asp Trp Leu 
85 

Pro Glu Asp Val 



> NO: 23 : 

Gly Thr Leu Val 
10 

Trp Val His Val 
25 

Glu Ser Val Pro 



Val Val Leu Pro 
60 

Leu Leu Arg Gly 
75 

Phe Asn Val Val 
90 

Ala Val Ala Val 
105 



Thr Met Asp Gin 
15 

Gin Asp Gly Arg 
30 

Pro Pro Ala Asp 
45 

Gly Phe He Asn 



Gly Pro Ser His 
80 

Tyr Pro Gly Gin 
95 

Arg Leu Tyr Cys 
110 




Ala Glu Ala Val Arg 
115 

Ser Ala lie Tyr Pro 
130 

Glu Val Gly Val Arg 
145 

Asp Gly Arg lie Gin 
165 

Gin Val Glu Leu Cys 
180 

Arg lie Thr Ala Leu 
195 

lie Ser Val Trp Pro 
210 

Gly Met Arg Trp Ala 
225 

Thr Leu His Met Ala 
245 

Ser Pro Ala Glu Tyr 
260 

Gin Val Ala His Cys 
275 

His Arg His Asn Val 
290 

Leu Gly Ser Gly Val 
305 

Ala Met Gly lie Gly 
325 

Met lie Gly Asp Met 
340 

Arg Asp Ala Asp Val 
355 

He Asp Gly Ala Arg 
370 



62 



Ser Gly He Thr Thr He 
120 

Gly Asn He Glu Ala Ala 
135 

Val Val Tyr Ala Arg Met 
150 155 

Gly Tyr Val Asp Ala Leu 
170 

Ser He Met Glu Glu Thr 
185 

Ser Asp Gin Tyr His Gly 
200 

Ala Pro Ala Thr Thr Thr 
215 

Gin Ala Phe Ala Arg Asp 
230 235 

Glu Ser Asp His Asp Glu 
250 

Met Glu Cys His Gly Leu 
265 

Val Tyr Phe Asp Arg Lys 
280 

Lys Val Ala Ser Gin Val 
295 

Ala Pro Val Pro Glu Met 
310 315 

Thr Asp Asn Gly Asn Ser 
330 

Lys Phe Met Ala His He 
345 

Leu Thr Pro Glu Lys He 
360 

Ser Leu Gly Met Asp His 
375 



Asn Glu Asn Ala Asp 
125 

Met Ala Val Tyr Gly 
140 

Phe Phe Asp Arg Met 
160 

Lys Ala Arg Ser Pro 
175 

Ala Val Ala Lys Asp 
190 

Thr Ala Gly Gly Arg 
205 

Ala Val Thr Val Glu 
220 

Arg Ala Val Met Trp 
240 

Arg He His Gly Met 
255 

Leu Asp Glu Arg Leu 
270 

Asp Val Arg Leu Leu 
285 

Val Ser Asn Ala Tyr 
300 

Val Glu Arg Gly Met 
320 

Asn Asp Ser Val Asn 
335 

His Arg Ala Val His 
350 

Leu Glu Met Ala Thr 
365 

Glu He Gly Ser He 
380 



Glu Thr Gly Lys Arg Ala Asp Leu He Leu Leu Asp Leu Arg His Pro 
385 390 395 400 



■'"S 
..J 
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Gin Thr Thr Pro His His His Leu Ala Ala Thr He Val Phe Gin Ala 



405 



410 415 



Tyr Gly Asn Glu Val Asp Thr Val Leu He Asp Gly Asn Val Val Met 



Glu Asn Arg Arg Leu Ser Phe Leu Pro Pro Glu Arg Glu Leu Ala Phe 
435 440 445 

Leu Glu Glu Ala Gin Ser Arg Ala Thr Ala He Leu Gin Arg Ala Asn 
450 455 460 

Val Ala Asn Pro Ala Trp Arg Ser Leu Glu Met Thr Pro Leu Leu 



Met 



470 



475 



465 

His Pro Pro Pro Leu Glu Glu He Ala Ala He Leu Ala Arg Leu Gly 
485 490 495 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 496 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Gin Thr Leu Ser lie Gin His Gly Thr Leu Val Thr Met Asp Gin 



5 « 



Tyr Arg Arg Val Leu Gly Asp Ser Trp Val His Val Gin Asp Gly Arg 
. 20 25 

lie Val Ala Leu Gly Val His Ala Glu Ser Val Pro Pro Pro Ala Asp 

35 40 45 

Arg Val He Asp Ala Arg Gly Lys Val Val Leu Pro Gly Phe He Asn 



50 



55 



Ala His Thr His Val Asn Gin He Leu Leu Arg Gly Gly Pro Ser His 



65 



70 



75 



Gly Arg Gin Phe Tyr Asp Trp Leu Phe Asn Val Val Tyr Pro Gly Gin 



85 9° 



Lys Ala Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu Tyr Cys 



100 



105 



64 



Ala Glu Ala Val Arg Ser Gly lie Thr Thr lie Asn Glu Asn Ala Asp 
115 120 125 

Ser Ala lie Tyr Pro Gly Asn lie Glu Ala Ala Met Ala Val Tyr Gly 
130 135 140 

Glu Val Gly Val Arg Val Val Tyr Ala Arg Met Phe Phe Asp Arg Met 
145 150 155 160 

Asp Gly Arg lie Gin Gly Tyr Val Asp Ala Leu Lys Ala Arg Ser Pro 
165 170 175 

Gin Val Glu Leu Cys Ser lie Met Glu Glu Thr Ala Val Ala Lys Asp 
180 185 190 

Arg lie Thr Ala Leu Ser Asp Gin Tyr His Gly Thr Ala Gly Gly Arg 
195 200 205 

lie Ser Val Trp Pro Ala Pro Ala Thr Thr Thr Ala Val Thr Val Glu 
210 215 220 

Gly Met Arg Trp Ala Gin Ala Phe Ala Arg Asp Arg Ala Val Met Trp 
225 230 235 240 

Thr Leu His Met Ala Glu Ser Asp His Asp Glu Arg lie His Gly Met 
245 250 255 

Ser Pro Ala Glu Tyr Met Glu Cys Tyr Gly Leu Leu Asp Glu Arg Leu 
260 265 270 

Gin Val Ala His Cys Val Tyr Phe Asp Arg Lys Asp Val Arg Leu Leu 
275 280 285 

His Arg His Asn Val Lys Val Ala Ser Gin Val Val Ser Asn Ala Tyr 

290 295 300 

Leu Gly Ser Gly Val Ala Pro Val Pro Glu Met Val Glu Arg Gly Met 
305 310 315 320 

Ala Val Gly lie Gly Thr Asp Asn Gly Asn Ser Asn Asp Ser Val Asn 
325 330 335 

Met lie Gly Asp Met Lys Phe Met Ala His lie His Arg Ala Val His 
340 345 350 

Arg Asp Ala Asp Val Leu Thr Pro Glu Lys lie Leu Glu Met Ala Thr 
355 360 365 

lie Asp Gly Ala Arg Ser Leu Gly Met Asp His Glu lie Gly Ser lie 
370 375 380 

Glu Thr Gly Lys Arg Ala Asp Leu lie Leu Leu Asp Leu Arg His Pro 
385 390 395 400 



Gin Thr Thr Pro His His His Leu Ala Ala Thr lie Val Phe Gin Ala 
405 410 - 415 



65 

Tyr Gly Asn Glu Val Asp Thr Val Leu lie Asp Gly Asn Val Val Met 
420 425 430 

Glu Asn Arg Arg Leu Ser Phe Leu Pro Pro Glu Arg Glu Leu Ala Phe 
435 440 445 

Leu Glu Glu Ala Gin Ser Arg Ala Thr Ala He Leu Gin Arg Ala Asn 
450 455 460 

Met Val Ala Asn Pro Ala Trp Arg Ser Leu Glu Met Thr Pro Leu Leu 
465 470 475 480 

His Pro Leu Pro Leu Glu Glu He Ala Ala He Leu Ala Arg Leu Gly 
485 490 495 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 496 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Gin Thr Leu Ser He Gin His Gly Thr Leu Val Thr Met Asp Gin 
15 10 15 

Tyr Arg Arg Val Leu Gly Asp Ser Trp Val His Val Gin Asp Gly Arg 
20 25 30 

He Val Ala Leu Gly Val His Ala Glu Ser Val Pro Pro Pro Ala Asp 
35 40 45 

Gin Val He Asp Ala Arg Gly Lys Val Val Leu Pro Gly Phe He Asn 
50 55 60 

Ala His Thr His Val Asn Gin He Leu Leu Arg Gly Gly Pro Ser His 
65 70 75 80 

Gly Arg Gin Phe His Asp Trp Leu Phe Asn Val Val Tyr Pro Gly Gin 
85 90 95 

Lys Ala Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu Tyr Cys 
100 105 110 

Ala Glu Ala Val Arg Ser Gly He Thr Thr He Ash Glu Asn Ala Asp 
115 120 125 



Ser Ala He Tyr Pro 
130 

Glu Val Gly Val Arg 
145 

Asp Gly Arg He Gin 
165 

Gin Val Glu Leu Cys 
180 

Arg He Thr Ala Leu 
195 

He Ser Val Trp Pro 
210 
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Gly Asn He Glu Ala Ala 
135 

Val Val Tyr Ala Arg Met 
150 155 

Gly Tyr Val Asp Ala Leu 
170 

Ser He Met Glu Glu Thr 
185 

Ser Asp Gin Tyr His Gly 
200 

Ala Pro Ala Thr Thr Thr 
215 



Met Ala Val Tyr Gly. 
140 

Phe Phe Asp Arg Met 
160 

Lys Ala Arg Ser Pro 
175 

Ala Val Ala Lys Asp 
190 

Thr Ala Gly Gly Arg 
205 

Ala Val Thr Val Glu 
220 



Gly Met Arg Trp Ala Gin Ala Phe Ala Arg Asp Arg Ala Val Met Trp 
]% 225 230 235 240 

^ Thr Leu His Met Ala Glu Ser Asp His Asp Gly Arg He His Gly Met 

!^ 245 250 255 

;^ Ser Pro Ala Glu Tyr Met Glu Cys Tyr Gly Leu Leu Asp Glu Arg Leu 

jf*! 260 265 270 

Gin Val Ala His Cys Val Tyr Phe Asp Arg Lys Asp Val Arg Leu Leu 

l *3 275 280 285 

in 

*tl His Arg His Asn Val Lys Val Ala Ser Gin Val Val Ser Asn Ala Tyr 

=fl 290 295 300 

sssf 

Leu Gly Ser Gly Val Ala Pro Val Pro Glu Met Val Glu Arg Gly Met 

305 310 315 320 



Ala Val Gly He 



Met He Gly Asp 
340 

Arg Asp Ala Asp 
355 

He Asp Gly Ala 
370 

Glu Thr Gly Lys 
385 

Gin Thr Thr Pro 



Tyr Gly Asn Glu 
420 



Gly Thr Asp Asn 
325 

Met Lys Phe Met 



Val Leu Thr Pro 
360 

Arg Ser Leu Gly 
375 

Arg Ala Asp Leu 
390 

His His His Leu 
405 

Val Asp Thr Val 



Gly Asn Ser Asn 
330 

Ala His He His 
345 

Glu Lys He Leu 



Met Asp His Glu 
380 

He Leu Leu Asp 
395 

Ala Ala Thr He 
410 

Leu He Asp Gly 
425 



Asp Ser Val Asn 
335 

Arg Ala Val His 
350 

Glu Met Ala Thr 
365 

He Gly Ser He 



Leu Arg His Pro 
400 

Val Phe Gin Ala 
415 

Asn Val Val Met 
430 



Glu Asn Arg Arg Leu Ser Phe Leu 
435 440 

Leu Glu Glu Ala Gin Ser Arg Ala 
450 455 

Met Val Ala Asn Pro Ala Trp Arg 
465 470 

His Pro Pro Pro Leu Glu Glu lie 
485 



67 

Pro Pro Glu Arg Glu Leu Ala Phe 
445 

Thr Ala lie Leu Gin Arg Ala Asn 
460 

Ser Leu Glu Met Thr Pro Leu Leu 
475 480 

Ala Ala lie Leu Ala Arg Leu Gly 
490 495 



INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 496 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 



Met Gin Thr Leu 
1 

Tyr Arg Arg Val 
20 

lie Val Ala Leu 
35 

Arg Val lie Asp 
50 

Ala His Thr His 
65 

Gly Arg Gin Phe 



Lys Ala Met Arg 
100 

Ala Glu Ala Val 
115 

Ser Ala lie Tyr 
130 



Ser lie Gin His 
5 

Leu Gly Asp Ser 



Gly Val His Ala 
40 

Ala Arg Gly Lys 
55 

Val Asn Gin He 
70 

Tyr Asp Trp Leu 
85 

Pro Glu Asp Val 



Arg Ser Gly He 
120 

Pro Gly Asn He 
135 



Gly Thr Leu Val 
10 

Trp Val His Val 
25 

Glu Ser Val Pro 



Val Val Leu Pro 
60 

Leu Leu Arg Gly 
75 

Phe Asn Val Val 
90 

Ala Val Ala Val 
105 

Thr Thr He Asn 



Glu Ala Ala Met 
140 



Thr Met Asp Gin 
15 

Gin Asp Gly Arg 
30 

Pro Pro Ala Asp 
45 

Gly Phe He Asn 



Gly Pro Ser His 
80 

Tyr Pro Gly Gin 
95 

Arg Leu Tyr Cys 
110 

Glu Asn Ala Asp 
125 

Ala Val Tyr Gly 
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Glu Val Gly Val Arg Val Val Tyr Ala Arg Met Phe Phe Asp Arg Met 
145 150 155 160 

Asp Arg Arg lie Gin Gly Tyr Val Asp Ala Leu Lys Ala Arg Ser Pro 
165 170 175 

Gin Val Glu Leu Cys Ser lie Met Glu Glu Thr Ala Val Ala Lys Asp 
180 185 190 

Arg lie Thr Ala Leu Ser Asp Gin Tyr His Gly Thr Ala Gly Gly Arg 
195 200 205 

He Ser Val Trp Pro Ala Pro Ala Thr Thr Thr Ala Val Thr Val Glu 
210 215 220 

Gly Met Arg Trp Ala Gin Ala Phe Ala Arg Asp Arg Ala Val Met Trp 
225 230 235 240 

Thr Leu His Met Ala Glu Ser Asp His Asp Glu Arg He His Gly Met 
245 250 255 

Ser Pro Ala Glu Tyr Met Glu Cys Tyr Gly Leu Leu Asp Glu Arg Leu 
260 265 270 

Gin Val Ala His Cys Val Tyr Phe Asp Arg Lys Asp He Arg Leu Leu 
275 280 285 

His Arg His Asn Val Lys Val Ala Ser Gin Ala Val Ser Asn Ala Tyr 
290 295 300 

Leu Gly Ser Gly Val Ala Pro Val Pro Glu Met Val Glu Arg Gly Met 
305 310 315 320 

Ala Val Gly He Gly Thr Asp Asn Gly Asn Ser Asn Asp Ser Val Asn 

325 330 335 

Met He Gly Asp Met Lys Phe Met Ala His He His Arg Ala Val His 
340 345 350 

Arg Asp Ala Asp Val Leu Thr Pro Glu Lys He Leu Glu Met Ala Thr 
355 360 365 

He Asp Gly Ala Arg Ser Leu Gly Met Asp His Glu He Gly Ser He 
370 375 380 

Glu Thr Gly Lys Arg Ala Asp Leu He Leu Leu Asp Leu Arg His Pro 
385 390 395 400 

Gin Thr Thr Pro His His His Leu Ala Ala Thr He Val Phe Gin Ala 
405 410 415 

Tyr Gly Asn Glu Val Asp Thr Val Leu He Asp Gly Asn Val Val Met 
420 425 430 



Glu Asn Arg Arg Leu Ser Phe Leu Pro Pro Glu Arg Glu Leu Ala Phe 
435 440 445 



Leu Glu Glu Ala Gin Ser Arg Ala 
450 455 

Met Val Ala Asn Pro Ala Trp Arg 



His Pro Pro Pro Leu Glu Glu lie 
485 



69 

Thr Ala He Leu Gin Arg Ala Asn 
460 

Ser Leu Glu Met Thr Pro Leu Leu 
475 480 

Ala Ala He Leu Ala Gin Leu Gly 
490 495 



